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AMENDMENTS TO THE CLAIMS: 

1-48 (Cancelled) 

49. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing remote content in a first language through the Interne t , including content 
retrieved by crawling a web site via following links to additional pages; 

parsing the content into one or more translatable components; and 

determining whether there exists at least one of the translatable components that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one on e or more translatable 
component[[s.]] that does not have a corresponding translated component in order to size the 
content for language translation. 

50. (Canceled) 

5 1 . (Previously presented) The method according to claim 50, wherein the language 
translation includes human translating the one or more translatable components. 

52. (Previously presented) The method according to claim 49, further comprising the 
step of adding the one or more translatable components to a translation list for translation into a 
second language. 
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53. (Previously presented) The method according to claim 52, wherein: 

the first language includes one of English, French, Spanish, German, Portuguese, Italian, 
Chinese, Korean, and Arabic; 

the second language includes one of English, French, Spanish, German, Portuguese, 
Italian, Japanese, Chinese, Korean, and Arabic; and 

the second language is different from the first language. 

54. (Previously presented) The method according to claim 49, wherein the step of 
accessing includes retrieving the content in the first language from a source in a first language. 

55. (Previously presented) The method according to claim 49, wherein each of the 
translatable components is one of: 

a text segment; 
an image file; 
an audio clip; 
a video clip; 
a file; and 

any combination thereof in an electronic data stream. 

56. (Previously presented) The method according to claim 49, further comprising the 
step of generating an identifier for each of the translatable components so that each of the 
translatable components is accessible via a corresponding identifier. 

57. (Previously presented) The method according to claim 56, wherein the identifier for a 
text segment is generated using at least one of a hash code, a checksum, and a mathematical 
algorithm based on one or more text segments. 
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58. (Previously presented) The method according to claim 49, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

59. (Previously presented) The method according to claim 49, wherein the step of 
parsing is performed based on at least one markup tag contained in the content in the first 
language. 

60. (Currently Amended) A machine implemented method for providing statistics 
characterizing content, comprising the steps of: 

accessing remote content in a first language throug h the Internet including content 
retrieved by crawling a web site via following links to additional pages; 
parsing the content into one or more components; [[and]] 

determining whether there exists at least one of the translatable components that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one translatable tho ono or mor e 
components.]] that does not have a corresponding translated comp onent in order to size the 
content for language translation. 

61. (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing remote content in a first language 
through the Interne t , including content retrieved by crawling a web site via following links to 
additional pages; 
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an information processing unit configured for parsing the content into one or more 
translatable components; [[and]] 

a determining unit configured for determining whether t here exists at least one of the 
translatable components that does not have a corre sponding translated component; and 

a statistics generation unit configured for generating statistics based only on the at least 
one one or more translatable components.]] that does not have a corre sponding translated 
component in order to size the content for langu age translation. 

62. (Canceled) 

63. (Previously presented) The system according to claim 62, wherein the language 
translation includes human translating the one or more translatable components. 

64. (Previously Presented) The system according to claim 61, wherein the content 
accessing unit is further configured for facilitating retrieval of content from a data source in the 
first language. 

65. (Previously presented) The system according to claim 61, wherein each of the 
translatable components is one of: 

a text segment; 
an image file; 
an audio clip; 
a video clip; 
a file; and 

any combination thereof in an electronic data stream. 
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66. (Previously presented) The system according to claim 61, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

67. (Previously presented) The system according to claim 61, wherein the first language 
includes one of English, French, Spanish, German, Portuguese, Italian, Chinese, Korean, and 
Arabic. 

68. (Previously Presented) The system according to claim 61, wherein the information 
processing unit parses the content based on at least one markup tag contained in the content. 

69. (Previously presented) The system according to claim 61, further comprising a 
translation list to which the one or more components are added for translation. 

70. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causing the machine to perform the following: 

accessing remote content in a first language through the Internet , including content 
retrieved by crawling a web site via following links to additional pages; 

parsing the content into one or more translatable components; [[and]] 

determining whether there exists at least one of the translatable com ponents that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one ono or mor e translatable 
component^.]] that does not have a corresponding translated component in order to size the 
content for language translation. 

71. (Canceled) 
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72. (Previously presented) The medium according to claim 71, wherein the language 
translation includes human translating the one or more translatable components. 

73. (Previously presented) The medium according to claim 70, wherein the statistics 
includes at least one of a file count, a page count, a text segment count, a unique text segment 
count, a word count, and a unique word count. 

74. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 



accessing remote content in a first language through the Internet , including content 

retrieved by crawling a web site via following links to additional pages; 

parsing the content into one or more translatable components based on markup tags 
contained therein; [[and]] 

determining whether there exists at least one of the translatable components that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one one or mor e translatable 
components,]] that does not have a corresponding translated component in order to size the 
content for language translation, wherein the at least one translatable components are]] is to be 
human translated. 

75. (Previously presented) The method according to claim 74, further comprising the 
steps of: 

obtaining tracking information corresponding to a source from where the content is 
retrieved; and 
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utilizing the tracking information when additional content is to be retrieved from the 

source. 

76. (Previously presented) The method according to claim 75, wherein the tracking 
information includes at least one of state information or session information. 

77. (Previously presented) The method according to claim 76, wherein the tracking 
information is obtained via a cookie. 

78. (Previously Presented) The method according to claim 74, wherein when the content 
contains an electronic form, the method further comprising the steps of: 

populating the electronic form automatically with pre-defined information; and 
submitting the populated electronic form to the source. 

79. (Previously presented) The method according to claim 74, wherein the source from 
where the content is retrieved is a web site. 

80. (Previously presented) The method according to claim 74, further comprising: 
identifying an image file associated with the content in the first language as a translatable 

component; 

generating statistics for the image file. 

81. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated manually. 

82. (Previously presented) The method according to claim 80, wherein the statistics for 
the image file is generated automatically. 
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83. (Currently Amended) A machine implemented method for providing statistics 
characterizing content for language translation, comprising the steps of: 

accessing remote content in a first language through the Internet , including content 
retrieved by crawling a web site via following links to additional pages; 

parsing the content into one or more translatable components; [[and]] 

determining whether there exists at least one of the translatable components that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one one or more translatable 
components,]] that does not have a corresponding translated comp onent in order to size the 
content for language translation, wherein the statistics are used to measure the size the content 
for language translation. 

84. (Currently Amended) A system for providing statistics characterizing content for 
language translation, comprising: 

a content accessing unit configured for accessing remote content in a first language 
through the Internet , including content r e trieved by crawling a web site via following links to 
additional pages; 

an information processing unit configured for parsing the content into one or more 
translatable components; [[and]] 

a determining unit configured for determining whether there exists at least one of the 
translatable components that does not have a corresponding translated component; and 

a statistics generation unit configured for generating statistics based only on the at lest 
one one or more translatable components]] that does not have a corresponding translated 
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component in order to size the content for language translation , wherein the statistics are used to 
measure the size the content for language translation. 

85. (Currently Amended) A machine readable medium having data stored thereon, the 
data, when read, causing the machine to perform the following: 

accessing remote content in a first language through the Internet, including content 
retrieved by crawling a web site via following links to additional pages; 

parsing the content into one or more translatable components; [[and]] 

determining whether there exists at least one of the translatable component s that does not 
have a corresponding translated component; and 

generating statistics based only on the at least one one or more translatable 
components]] that does not have a corresponding translated compon ent in order to size the 
content for language translation , wherein the statistics are used to measure the size the content 
for language translation. 
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